DWT-based classification of acoustic-phonetic classes and phonetic units

نویسندگان

Gernot Kubin

Tuan Van Pham

چکیده

In this paper, we describe a new algorithm based on the discrete wavelet transform (DWT) which uses a multithreshold decision model (MTD model) to detect acoustic and phonetic classes (based on 10ms speech signal segments). The best thresholds of the model are found by using experimental pattern classification. Then a unit level interpolation technique is combined with the MTD model to classify phonetic units (based on sequences of 10ms segments). The results of the classifiers are compared and jointly adjusted by an interactive scheme (IS) in order to improve the performance of the algorithm. The algorithm is tested with the TIMIT database and compared with the SUB-CRA-based algorithm and other algorithms to demonstrate its effectiveness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An approach to obtain weighted graphs of words based on phoneme detection

In this paper, we present an approach for phoneme detection and phonetic classification that can be used as a basis for different speech processes, such as phoneme boundary detection, acoustic-phonetic decoding or word-graph construction with acoustic confidence scores. The phonetic classifier that has been developed is based on a phase of acoustic vector clustering in the space of acoustic cha...

متن کامل

Significance of group delay based acoustic features in the linguistic search space for robust speech recognition

In this paper we discuss the complementarity of the group delay features with respect to other conventional acoustic features and also propose the use of such diverse information in the linguistic search space for robust speech recognition. A discriminability analysis is carried out on various classes of phonetic units. A class based phonetic unit analysis is conducted to compare the suitabilit...

متن کامل

Knowledge based approach to consonant recognition

This paper presents a knowledge based approach to consonant recognition. In traditional knowledge based systems, the expert is the linguist/phonetician who attempts to describe and quantify the acoustic events, in the form of production rules into phonetic description. This paper proposes to alter the expert's role so that the expert only needs to provide the basic structure of the phonetic cla...

متن کامل

Using Chi-Square Testing in Modeling Confusion Characteristics for Robust Phonetic Set Generation

A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. In order to obtain better phonetic representation, context-dependent units are used to model co-articulation effects between phones and have been broadly in speech recognition. However, this representation generally increases the number of recognition ...

متن کامل

Syllable structure based phonetic units for context-dependent continuous Thai speech recognition

Choice of the phonetic units speech recognizer is a factor greatly affecting the system performance. Phonetic units are normally defined according to the acoustic properties of a speech. Nevertheless, with the limit of training data, too delicate acoustic properties are ignored. Syllable structure is one of the properties usually ignored in English phonetic units due to a lot of possible onsets...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

DWT-based classification of acoustic-phonetic classes and phonetic units

نویسندگان

چکیده

منابع مشابه

An approach to obtain weighted graphs of words based on phoneme detection

Significance of group delay based acoustic features in the linguistic search space for robust speech recognition

Knowledge based approach to consonant recognition

Using Chi-Square Testing in Modeling Confusion Characteristics for Robust Phonetic Set Generation

Syllable structure based phonetic units for context-dependent continuous Thai speech recognition

عنوان ژورنال:

اشتراک گذاری